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SEQUENCE LISTING 

3 (1) GENERAL INFORMATION: 

5 (i) APPLICANT: E . I. DU PONT DE NEMOURS AND 

6 COMPANY 

8 (ii) TITLE OF INVENTION: CHIMERIC GENES AND METHODS FOR 

9 INCREASING THE LYSINE AND 

10 THREONINE CONTENT OF THE SEEDS OF 

11 PLANTS 
13 (iii) NUMBER OF SEQUENCES: 107 

15 (iv) CORRESPONDENCE ADDRESS: 

16 (A) ADDRESSEE: E. I. DU PONT DE NEMOURS 

17 AND COMPANY 

18 (B) STREET: 1007 MARKET STREET 

19 (C) CITY: WILMINGTON 

20 (D) STATE: DELAWARE 

21 (E) COUNTRY: U.S.A. 

22 (F) ZIP: 19898 

24 (V) COMPUTER READABLE FORM: 

25 (A) MEDIUM TYPE: FLOPPY DISK 

26 (B) COMPUTER: IBM PC COMPATIBLE 

27 (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

28 (D) SOFTWARE: MICROSOFT WORD VERSION 2 . 0C 
30 (vi) CURRENT APPLICATION DATA: 

C--> 31 (A) APPLICATION NUMBER: US/10/023 , 066A 

C--> 32 (B) FILING DATE: 29-Apr-2002 

33 (C) CLASSIFICATION: 

35 (viii) ATTORNEY/AGENT INFORMATION: 

36 (A) NAME: BARBARA C. SIEGELL 

37 (B) REGISTRATION NUMBER: 30,684 

38 (C) REFERENCE/DOCKET NUMBER: BB-1037-C 

40 (ix) TELECOMMUNICATION INFORMATION: 

41 (A) TELEPHONE: 302-992-4931 

42 (B) TELEFAX: 302-773-0164 

43 (C) TELEX: 835420 

45 (2) INFORMATION FOR SEQ ID NO : 1: 

47 (i) SEQUENCE CHARACTERISTICS: 

48 (A) LENGTH: 1350 base pairs 

49 (B) TYPE: nucleic acid 

50 (C) STRANDEDNESS : single 

51 (D) TOPOLOGY: linear 

53 (ii) MOLECULE TYPE: DNA (genomic) 

55 (ix) FEATURE: 

56 (A) NAME/KEY: CDS 
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57 (B) LOCATION: 1..1350 

59 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

61 ATG GCT GAA ATT GTT GTC TCC AAA TTT GGC GGT ACC AGC GTA GCT GAT 48 

62 Met Ala Glu He Val Val Ser Lys Phe Gly Gly Thr Ser Val Ala Asp 

63 1 5 10 15 

65 TTT GAC GCC ATG AAC CGC AGC GCT GAT ATT GTG CTT TCT GAT GCC AAC 96 

66 Phe Asp Ala Met Asn Arg Ser Ala Asp He Val Leu Ser Asp Ala Asn 

67 20 25 30 

69 GTG CGT TTA GTT GTC CTC TCG GCT TCT GCT GGT ATC ACT AAT CTG CTG 144 

70 Val Arg Leu Val Val Leu Ser Ala Ser Ala Gly He Thr Asn Leu Leu 

71 35 40 45 

73 GTC GCT TTA GCT GAA GGA CTG GAA CCT GGC GAG CGA TTC GAA AAA CTC 192 

74 Val Ala Leu Ala Glu Gly Leu Glu Pro Gly Glu Arg Phe Glu Lys Leu 

75 50 55 60 

77 GAC GCT ATC CGC AAC ATC CAG TTT GCC ATT CTG GAA CGT CTG CGT TAC 240 

78 Asp Ala He Arg Asn He Gin Phe Ala He Leu Glu Arg Leu Arg Tyr 

79 65 70 75 80 

81 CCG AAC GTT ATC CGT GAA GAG ATT GAA CGT CTG CTG GAG AAC ATT ACT 288 

82 Pro Asn Val He Arg Glu Glu He Glu Arg Leu Leu Glu Asn lie Thr 

83 85 90 95 

85 GTT CTG GCA GAA GCG GCG GCG CTG GCA ACG TCT CCG GCG CTG ACA GAT 336 

86 Val Leu Ala Glu Ala Ala Ala Leu Ala Thr Ser Pro Ala Leu Thr Asp 

87 100 105 110 

89 GAG CTG GTC AGC CAC GGC GAG CTG ATG TCG ACC CTG CTG TTT GTT GAG 384 

90 Glu Leu Val Ser His Gly Glu Leu Met Ser Thr Leu Leu Phe Val Glu 

91 115 120 125 

93 ATC CTG CGC GAA CGC GAT GTT CAG GCA CAG TGG TTT GAT GTA CGT AAA 432 

94 He Leu Arg Glu Arg Asp Val Gin Ala Gin Trp Phe Asp Val Arg Lys 

95 130 135 140 

97 GTG ATG CGT ACC AAC GAC CGA TTT GGT CGT GCA GAG CCA GAT ATA GCC 480 

98 Val Met Arg Thr Asn Asp Arg Phe Gly Arg Ala Glu Pro Asp lie Ala 

99 145 150 155 160 

101 GCG CTG GCG GAA CTG GCC GCG CTG CAG CTG CTC CCA CGT CTC AAT GAA 528 

102 Ala Leu Ala Glu Leu Ala Ala Leu Gin Leu Leu Pro Arg Leu Asn Glu 

103 165 170 175 

105 GGC TTA GTG ATC ACC CAG GGA TTT ATC GGT AGC GAA AAT AAA GGT CGT 576 

106 Gly Leu Val He Thr Gin Gly Phe He Gly Ser Glu Asn Lys Gly Arg 

107 180 185 190 

109 ACA ACG ACG CTT GGC CGT GGA GGC AGC GAT TAT ACG GCA GCC TTG CTG 624 

110 Thr Thr Thr Leu Gly Arg Gly Gly Ser Asp Tyr Thr Ala Ala Leu Leu 

111 195 200 205 

113 GCG GAG GCT TTA CAC GCA TCT CGT GTT GAT ATC TGG ACC GAC GTC CCG 672 

114 Ala Glu Ala Leu His Ala Ser Arg Val Asp He Trp Thr Asp Val Pro 

115 210 215 220 

117 GGC ATC TAC ACC ACC GAT CCA CGC GTA GTT TCC GCA GCA AAA CGC ATT 720 

118 Gly He Tyr Thr Thr Asp Pro Arg Val Val Ser Ala Ala Lys Arg He 

119 225 230 235 240 

121 GAT GAA ATC GCG TTT GCC GAA GCG GCA GAG ATG GCA ACT TTT GGT GCA 768 

122 Asp Glu He Ala Phe Ala Glu Ala Ala Glu Met Ala Thr Phe Gly Ala 
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191 (2) INFORMATION FOR SEQ ID NO: 3: 

193 (i) SEQUENCE CHARACTERISTICS: 

194 (A) LENGTH: 36 base pairs 

195 (B) TYPE: nucleic acid 

196 (C) STRANDEDNESS: single 

197 (D) TOPOLOGY: linear 

199 (ii) MOLECULE TYPE: DNA (genomic) 

201 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

203 GTACCGCCAA ATTTGGAGAC AACAATTTCA GCCATG 36 

205 (2) INFORMATION FOR SEQ ID NO: 4: 

207 (i) SEQUENCE CHARACTERISTICS: 

208 (A) LENGTH: 48 base pairs 

209 (B) TYPE: nucleic acid 

210 (C) STRANDEDNESS: single 

211 (D) TOPOLOGY: linear 

213 (ii) MOLECULE TYPE: DNA (genomic) 

215 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

217 CCCGGGCCAT GGCTACAGGT TTAACAGCTA AGACCGGAGT AGAGCACT 48 

219 (2) INFORMATION FOR SEQ ID NO: 5: 

221 (i) SEQUENCE CHARACTERISTICS: 

222 (A) LENGTH: 37 base pairs 

223 (B) TYPE: nucleic acid 

224 (C) STRANDEDNESS: single 

225 (D) TOPOLOGY: linear 

227 (ii) MOLECULE TYPE: DNA (genomic) 

229 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

231 GATATCGAAT TCTCATTATA GAACTCCAGC TTTTTTC 37 

233 (2) INFORMATION FOR SEQ ID NO: 6: 

235 (i) SEQUENCE CHARACTERISTICS: 

236 (A) LENGTH: 917 base pairs 

237 (B) TYPE: nucleic acid 

238 (C) STRANDEDNESS: single 

239 (D) TOPOLOGY: linear 

241 (ii) MOLECULE TYPE: DNA (genomic) 

243 (ix) FEATURE: 

244 (A) NAME/KEY: CDS 

245 (B) LOCATION: 3 . . 911 

247 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

249 CC ATG GCT ACA GGT TTA ACA GCT AAG ACC GGA GTA GAG CAC TTC GGC 47 

250 Met Ala Thr Gly Leu Thr Ala Lys Thr Gly Val Glu His Phe Gly 

251 1 ~ 5 10 15 

253 ACC GTT GGA GTA GCA ATG GTT ACT CCA TTC ACG GAA TCC GGA GAC ATC 95 

254 Thr Val Gly Val Ala Met Val Thr Pro Phe Thr Glu Ser Gly Asp lie 

255 20 25 30 

257 GAT ATC GCT GCT GGC CGC GAA GTC GCG GCT TAT TTG GTT GAT AAG GGC 143 

258 Asp He Ala Ala Gly Arg Glu Val Ala Ala Tyr Leu Val Asp Lys Gly 

259 35 40 45 

261 TTG GAT TCT TTG GTT CTC GCG GGC ACC ACT GGT GAA TCC CCA ACG ACA 191 

262 Leu Asp Ser Leu Val Leu Ala Gly Thr Thr Gly Glu Ser Pro Thr Thr 
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1 111 


JCl 


rim 


Pl 11 
bill 


Pl 11 

UlU 


Pl V 

vji y 


A cn 
no^ 


T.on 
lieU 


Va 1 

Val 


Arg Ala 




OK) / 




ZZd 










7 ^ n 
z 0 u 










7 ^S 












j uy 


ppp 
LLL 


P A A 

LAA 


A TP 

AIL 


A A P 

AAL 


ppp 

LLL 


AAA 

AAA 


PT A 
LI A 


TP A 
1 Ln 


PPP 
LLL 


PTP 
LI L 


PT A 
Lin 


PPT 
LL -L 


PPP 
LLL 


PAA 


GGT 


CGC 


767 




Arg 


p 1 11 
LIU 


lie 


Asn 


Ala 
Ala 


Lys 


lieU 


Co T' 

Del 


rl L> 


T.on 
liell 


Va 1 
Val 


Ala 
nia 


Ala 
nia 


Pl n 

ulll 


Gly Arg 




111 
Jll 


Z4U 










Z 4 D 










7 sn 

Z J u 










255 




j X j 


1 1L 


GGT 


GGA 


GTC 


APP 

ALL 


mmp 
1 1L 


GCA 


AAA 


PPT 
LL 1 


PPT 
uL 1 


PTP 


PPT 


PTP 


PAP 
Lno 


GGC 


ATC 


815 




Leu 


Gly 


Gly 


Val 


Ser 


Leu 


Ala 


Lys 


Ala 


Ala 
nia 


T.on 
liell 


A Trr 
nl y 


T.on 


Pi Tl 
ulll 


Gly 


He 




"*1 5 










260 










265 










270 






317 


AAC 


GTA 


GGA 


GAT 


CCT 


CGA 


CTT 


CCA 


ATT 


ATG 


GCT 


CCA 


AAT 


GAG 


CAG 


GAA 


863 


318 


Asn 


Val 


Gly 


Asp 


Pro 


Arg 


Leu 


Pro 


He 


Met 


Ala 


Pro 


Asn 


Glu 


Gin 


Glu 




319 








275 










280 










285 








321 


CTT 


GAG 


GCT 


CTC 


CGA 


GAA 


GAC 


ATG 


AAA 


AAA 


GCT 


GGA 


GTT 


CTA 


TAA 


TGAGAATTC 


322 


Leu 


Glu 


Ala 


Leu 


Arg 


Glu 


Asp 


Met 


Lys 


Lys 


Ala 


Gly 


Val 


Leu 


* 
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290 










295 










300 










325 


(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 7 
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(i) 


SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 22 base pairs 
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RAW SEQUENCE LISTING ERROR SUMMARY 
PATENT APPLICATION: US/10/023 , 066A 



DATE: 05/13/2002 
TIME: 09:31:21 



Input Set : A:\BB1037 US DIV CORRECTED SEQ LISTING . txt 
Output Set: N:\CRF3\05102002\J023066A.raw 



Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 

Seq#:93; N Pos . 557,558,559,560,561 
Seq#:102; N POS . 97 
Seq#:103; N Pos . 253,289,293 
Seq#:104; Xaa Pos. 32 
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VERIFICATION SUMMARY DATE: 05/13/2002 

PATENT APPLICATION: US/10/023 f 066A TIME: 09:31:21 

Input Set : A:\BB1037 US DIV CORRECTED SEQ LISTING.txt 
Output Set: N:\CRF3\05102002\J023066A.raw 

L:31 M:220 C: Keyword misspelled or invalid format, [(A) APPLICATION NUMBER:] 
L:32 M:220 C: Keyword misspelled or invalid format, [(B) FILING DATE:] 
L:175 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:775 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 29 
L:842 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 31 
L:1379 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 53 
L:2635 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:104 after pos.:16 
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